Fly 6:4, 284-289; October/November/December 201 2; © 201 2 Landes Bioscience 

Non-destructive species identification 
of Drosophila obscura and D. subobscura (Diptera) 
using near-infrared spectroscopy 

Stefanie Fischnaller, 1 '* Floyd E. Dowell, 2 Alexandra Lusser, 3 Birgit C. Schlick-Steiner u and Florian M. Steiner u 

'Molecular Ecology Group; Institute of Ecology; University of Innsbruck; Innsbruck, Austria; 2 United States Department of Agriculture; Agricultural Research Service; 
Manhattan, KS USA; department of Molecular Biology; Biocenter; Innsbruck Medical University; Innsbruck, Austria 

+ These authors contributed equally to this work. 

Keywords: Drosophila obscura, Drosophila subobscura, species identification, near-infrared spectroscopy, cuticular hydrocarbons, 

laboratory adaptation 

Abbreviations: CHCs, cuticular hydrocarbons; NIRS, near-infrared spectroscopy; PLS, partial least squares; Wm, field-collected 
males; Fl, first lab-reared generation; F8, eighth lab-reared generation; r 2 , coefficient of determination; 

SECV, standard error of cross validation 



The vinegar flies Drosophila subobscura and D. obscura frequently serve as study organisms for evolutionary biology. 
Their high morphological similarity renders traditional species determination difficult, especially when living specimens 
for setting up laboratory populations need to be identified. Here we test the usefulness of cuticular chemical profiles 
collected via the non-invasive method near-infrared spectroscopy for discriminating live individuals of the two species. 
We find a classification success for wild-caught specimens of 85%. The species specificity of the chemical profiles persists 
in laboratory offspring (87-92% success). Thus, we conclude that the cuticular chemistry is genetically determined, 
despite changes in the cuticular fingerprints, which we interpret as due to laboratory adaptation, genetic drift and/or diet 
changes. However, because of these changes, laboratory-reared specimens should not be used to predict the species- 
membership of wild-caught individuals, and vice versa. Finally, we demonstrate that by applying an appropriate cut-off 
value for interpreting the prediction values, the classification success can be immensely improved (to up to 99%), albeit 
at the cost of excluding a considerable portion of specimens from identification. 



Introduction 

Drosophila obscura and D. subobscura (Diptera: Drosophilidae) 
are closely related species of the D. obscura group, 1 with a wide 
distribution in the Palaearctic. Both are generalists and co-occur 
broadly in the colline and alpine zone. 2 They are frequently used 
species in evolutionary-biological studies (for review see refs. 
3-7). Accurate species identification of living specimens of both 
sexes is difficult, as the two species are morphologically highly 
similar, 8 with considerable intraspecific variation in the diagnos- 
tic characters. 9 The problem is aggravated by the need to keep to 
a minimum the anesthesia by C0 2 , to avoid reduced longevity 
and fecundity 10,11 For introducing wild-caught individuals to the 
laboratory with the aim to retain genetic variation, a rapid and 
non-destructive method for species identification with the poten- 
tial for high throughput would thus be desirable as an alternative 
to morphology-based methods. 

Insect cuticular layers contain complex mixtures of hydrocar- 
bons (CHCs), many of which are synthesized by the insect itself, 
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i.e., supposedly genetically determined. 12 In addition to their cen- 
tral role in the prevention of desiccation, 13 CHCs are important 
for chemical communication, for example in mate choice 14,15 and 
social behavior. 16 The idea that the bouquet of CHCs will thus be 
species specific led researchers to enquire into their usefulness in 
species identification 17,18 and various examples of success exist." We 
decided to test the usefulness of near-infrared spectroscopy (NIRS) 
to discriminate between D. subobscura and D. obscura: Previous 
studies suggested differences in desiccation resistance between the 
two species, 8 and, possibly due to a lack of interspecific mate recog- 
nition, no hybridization between them has been reported as yet, 20 
both of which may involve CHC species specificity. 

NIRS characterizes chemical patterns qualitatively and 
quantitatively based primarily on C-H, N-H, O-H stretching 
vibrations. 21 It is thus a useful tool for the characterization of 
biological material, and is, also owing to its non-destructive and 
non-invasive nature, becoming common practice in ecology 22 
and entomology. 23,24 Of relevance here, it was successfully used 
in the identification of (non-drosophilid) dipterans. 25 " 27 
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Table 1. Classification results of Drosophila subobscura and D. obscura based on PLS regression models developed from near-infrared spectra 
(500-2200 nm) 







Males 




Females 




Wild 


F1 


F8 


F1 


F8 


PLS factors 


7 


11 


9 


12 


13 


r 2 


0.57 


0.63 


0.55 


0.49 


0.43 


SECV 


0.33 


0.34 


0.34 


0.38 


0.40 


n in the validation sets: D. subobscura I D. obscura 


252/ 15 


19/24 


90/64 


27/41 


60/67 


Cross-validation results: % correctly classified 


90% 


94% 


84% 


87% 


83% 


n correctly classified (%) / n total 


226 (85%) / 267 


38 (88%) / 43 


138 (90%)/ 154 


59 (87%) / 68 


117 (92%)/ 127 


After exclusion of class values 1 .4-1 .6: n (%) 
correctly classified / n total 


213 (90%)/ 237 


36 (90%) / 40 


127 (91%)/ 139 


56 (90%) / 62 


111 (96%)/ 116 


After exclusion of class values 1 .3-1 .7: n (%) 
correctly classified / n total 


182 (95%)/ 191 


26 (90%) / 29 


106 (94%)/ 113 


47 (92%)/ 51 


96 (97%) / 99 


After exclusion of class values 1 .2-1 .8: n (%) 
correctly classified / n total 


145 (97%)/ 148 


18 (90%)/ 20 


86 (92%) / 93 


44 (94%) /47 


80 (99%) / 82 



n = number of individuals 



The objectives of this study were to determine if NIRS (1) 
can be used to discriminate living D. obscura and D. subobscura 
specimens by using multivariate chemometrics, and (2) whether 
calibration models elaborated for wild-caught specimens and 
for specimens from different laboratory-reared generations can 
be cross-applied. Cross-applicability would reduce significantly 
the effort needed for establishing the identification of specimens 
with such differing backgrounds. However, it needs to be kept in 
mind that genetic and phenotypic changes can arise from evolu- 
tion in a novel environment. 28 The CHC bouquet, in particular, 
can evolve due to changes in the ambient thermal regime and 
in diet composition 29 but can also change due to acquisition of 
hydrocarbons from food. 30 

Results 

Statistical parameters of the partial least squares (PLS) calibra- 
tion models (number of PLS factors used, coefficient of deter- 
mination (r2) and standard error of cross validation (SECV)) 
and the classification results for the validation sets are listed in 
Table 1. The calibration models had r 2 values between 0.43 and 
0.63, and SECV values between 0.33 and 0.40. The correct clas- 
sification for the validation sets ranged between 85% and 92%; 
the best prediction results were achieved for the eighth lab-reared 
generation (F8), with 90% for F8 males (F8 min) and 92% for 
F8 females (F8f). 

We then explored how the exclusion of prediction values 
around 1.5 influenced classification rate and loss of specimens, by 
symmetrically excluding values below and above 1.5, decreasing 
and increasing in steps of 0.02 to the extremes of 1.0 and 2.0, 
respectively. Exclusion of values between 1.4 and 1.6 resulted in, 
for example, the F8 females in an increase of the correct classifica- 
tion from 92% to 96% and in the exclusion of 10% of specimens 
(Fig. 1). For the other models and validation sets, the corre- 
sponding values were similar at increases of correct classification 



to 90-91% and 9-11% of individuals excluded (Table 1); for our 
data set sizes, we found this to represent an acceptable compro- 
mise across models between accuracy and number of specimens 
excluded. When values 1.30-1.70 and 1.20-1.80 were excluded, 
the success rate for F8f increased to 97% and 99%, but the por- 
tion of specimens identified dropped to 78% and 65%, respec- 
tively (Fig. 1). 

Wavelengths important for the identification of D. subobscura 
and D. obscura were identified from the PLS regression coeffi- 
cients, with wavelengths showing very high or very low coeffi- 
cients being more important. There were peaks occurring in all 
of the five calibration models and peaks that were important only 
in single models. Figure 2 shows the regression-coefficient plot 
for F8f. 

When calibration models created for one group were used to 
predict validation sets from the other groups, the classification 
success ranged from 56% to 83% (Table 2; prediction values 
between 1.4 and 1.6 excluded). 

Discussion 

Here we show that NIRS can be used to distinguish between 
Drosophila subobscura and D. obscura with an accuracy of 85% 
to 92% using PLS analysis, when using the full range of pre- 
diction values. This indicates that the composition of CHCs 
may differ between the two species. We cannot directly relate 
NIR-spectral differences to CHCs, and also the visible spec- 
tral range was relevant to successful PLS models (see further 
down), but we assume that CHC composition contributed 
significantly to species differences (compare refs. 14—19). The 
prediction results for the wild-caught flies were comparable to 
those obtained for laboratory-reared specimens, in line with 
the notion that hydrocarbon profiles are more genetically than 
environmentally determined 31 — although the two species were 
reared under the same conditions, differences in the cuticular 
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Figure I.The portion of correctly classified specimens depends on the exclusion of spectra with ambiguous prediction values, exemplified by the F8 
females. The thin line shows the increasing loss of individuals with increasing range of excluded prediction values, the bold line shows the corre- 
sponding increase in correct classification. Exclusion ranges in boxes are discussed in the text. 



profiles persisted and were detectable by NIRS. These findings 
contrast the NIRS study by Mayagaya et al. 25 who predicted 
two Anopheles species reared in the laboratory with an accuracy 
of almost 100%, and field-collected specimens with 80% accu- 
racy. Including both wild-caught flies and laboratory-reared 
flies (from all generations) in the same model did not improve 
our prediction results, the best models resulting in 82% and 
79% prediction success for females and males, respectively (S. 
Fischnaller, unpubl.). However, from the practical point of view 
of setting up breeding lines based on identification via NIRS, 
our error rates are not fatal, given that Drosophila obscura and 
D, subobscura do not hybridize. 20 Hence, no interspecific gene 
flow is expected for unintentionally heterospecific cultures, and 
the identification procedure can be repeated in consecutive 
generations. 

The lower rate of correct classification in our study as com- 
pared with the work by Rodriguez-Fernandez et al., 27 who used 
nine Diptera species, could be caused by a closer phylogenetic 
relatedness of our species as well as by our including multiple 
populations in the sample - genetic diversity was found to be 
very high across other wild populations of D. subobscura. 2 * 
Furthermore, we included individuals of all ages, and thus likely 
both unmated and mated individuals, in our calibration and vali- 
dation sets. NIRS is sensitive to the age of individuals, and thus 
used for age-grading of various insects, 25,32,33 and Everaerts et al. 34 
showed that in Drosophilidae, in both females and males, CHC 
changes occur during mating. The variation introduced by either 
or both of these effects may possibly have impeded greater success 
of our calibration procedures. 

One way to improve classification is the exclusion of speci- 
mens with prediction values around 1.5 (Table 1 and Fig. 1). 
This procedure was suggested by Sikulu et al. 26 in general, but 
to our knowledge the trade-off between increase of classifica- 
tion success and loss of specimens has not yet been explored 



in a quantitative manner. We suggest that such exploration be 
adopted as a standard procedure in NIRS species-identification 
studies. Depending on the demands for the specific project, 
researchers could thus prioritise either classification success or 
number of specimens identified in a controlled manner. Another 
way of improving accuracy with our species could be to scan 
just wings. Using the pulled-out right wings of thawed males in 
NIRS analysis enabled us to distinguish D. subobscura from D. 
obscura with 100% accuracy (n = 50 males per species; data not 
shown). This is in line with the findings from Shevtsova et al. 35 
who found high interspecific variation in the wing interference 
patterns of Drosophilidae. Scanning just wings of live specimens 
is very difficult to put into praxis, however, due to the need for 
standardised positioning of wings on the one hand and mini- 
mum C0 2 exposure of specimens at the other (S. Fischnaller, 
unpubl.). Exploring this possibility in depth remains subject to 
future exploration. 

Examination and comparison of the regression coefficient 
plots indicated that there are peaks important to species dis- 
crimination that are common to all five calibration models. The 
region around 510, 540 and 610 nm indicates that there are dif- 
ferences between the two species in the visible region, possibly 
caused by variation in cuticle thickness, bristles and/or pigmen- 
tation. 35 The region of 1,050—1,070 nm indicates vibration of 
water molecules at the third overtone, as well as occurrence of 
molecules containing N-H functional groups (ref. 36, also used 
for the interpretation of the subsequently listed wavelengths). 
Peaks at 1,370-1,390 nm (CH 2 second overtone, and water), 
1,720-1,730 nm (CH 3 first overtone), 1,810-1,840 nm and 
1,870 nm (C-H first overtone, water), and 2,140-2,180 nm 
(N-H and O-H combination bands) also contributed to all cali- 
bration models. 

Our study suggests that wild-caught specimens of our species 
should not be used to identify laboratory-reared specimens, and 
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Figure 2. PLS regression coefficient used for identifying important wavelengths for classification of Drosophila subobscura and D. obscura females from 
the F8. 



Table 2. Correct classification rate (%) for validation sets performed on 
the different calibration models to test their cross-applicability (clas- 
sification values 1.4-1.6 excluded) 



Calibration model 


Validation set 


Wild 


F1 


F8 


Wm 


90 


83 


75 


F1m 


65 


90 


77 


F8m 


67 


77 


91 


F1f 


n.a. 


90 


56 


F8f 


n.a. 


57 


96 



n.a. = not applicable 

vice versa, due to excessive failure rates (Table 2). This contrasts 
the findings of Mayagaya et al. 25 of 79% correctly classified wild- 
caught Anopheles when using models based on laboratory-reared 
individuals. Our low success rate is supported by absorption 
peaks in the regression coefficients exclusive to just one of the 
calibration models (e.g., 1,025 nm, 1,460 nm in Wm; 1,500 nm, 
2,050 nm in Fl min; 1,770 nm in Flf; 2,000 nm in F8f). In 
other words, chemical differences led to the observed generation 
specificity of the models. Toolson and Kuper-Simbron 29 reported 
for Drosophila pseudoobscura that maintenance in the laboratory 
leads to physiological and biochemical changes. They reported 
a shift in the cuticular composition even for the first generation 
of large populations reared in the laboratory, and explained it by 
changes of selective pressure and fitness advantages under novel 
environmental factors. Especially in small populations genetic 



drift can additionally increase the genetic differentiation across 
populations (D. subobscura: see refs. 28, 37). Also, hydrocarbon 
profiles can change in a non-inherited manner due to acquisi- 
tion of food-derived hydrocarbons (ant example: ref. 30). Thus, 
changes in the metabolic profiles - either due to genetic or envi- 
ronmental changes - may have altered the recorded NIRS data 
across generations, impeding the use of calibration models gener- 
ated for one generation in the others. Future research should aim 
to pinpoint potential non-inherited contributions as well as assess 
if this problem ceases in later generations which would indicate 
that it is due to rapid laboratory adaptation 5,38 or whether larger 
population sizes diminish it which would indicate that it is due to 
genetic drift (but note that our population sizes were in line with 
general practice, e.g., Fry 39 ). 

In conclusion, there are three main findings to our study: 
First, near-infrared spectroscopy proved a useful tool for the 
identification of living Drosophila flies. Second, we could not 
cross-apply models and validation sets among field-caught and 
lab-reared individuals and across generations, indicating changes 
due to laboratory adaptation, genetic drift and/or diet changes. 
Third, classification rates could be considerably improved by 
excluding prediction values around 1.5, suggesting that research- 
ers should consider excluding a particular range of prediction 
values depending on their research question. Our study thus 
underscores the enormous potential of the NIRS technique to 
species identification (e.g., refs. 24, 25, 26, 40 and 41), and indi- 
cates that it could become an important tool also for the delimi- 
tation of species in integrative taxonomy, 42 as well as in other 
biological fields. 43 
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Table 3. Sampling data for field-collected Drosophila obscura and D. subobscura 

Number of specimens collected 
Drosophila obscura Drosophila subobscura 



Location 


Geographic coordinates 


Altitude (m a.s.l.) 


females 


males 


females 


males 


Kaserstattalm 


47°07'34.86"N 11°17'30.83"E 


2,029 


4 


27 


3 


11 


Hahntennjoch 


47°17'24.07"N 10°39'19.97"E 


1,973 


3 


0 


6 


10 


Buzihutte 


47°16'20.99"N 11°21'23.27"E 


711 


0 


0 


32 


45 


Mentlberg 


47°14'55.34"N 11°21'56.31"E 


616 


0 


11 


59 


133 


Arzl 


47 0 17'11.22"N 11°25'09.80"E 


707 


0 


3 


22 


26 


Innsbruck city 


47°15'53.43"N 11°20'34.59"E 


579 


2 


9 


24 


62 



a.s.l. = above sea level 



Materials and Methods 

Insects. Specimens were collected from six different locations in 
North Tyrol (Austria) during August and September 2010. To 
represent a wide range of habitats, the collection sites were cho- 
sen from various altitudes between 570 and 2000 min above sea 
level (Table 3). The minimum and maximum distances between 
populations were 2 and 60 km, respectively. Collecting was done 
by net sweeping over baits of fermented banana 44 in the evening 
hours from 5 to 7 p.m. The field-caught flies were transported 
alive to the laboratory and anaesthetised with CO, for morphol- 
ogy-based species identification. C0 2 exposure length for spe- 
cies identification, as well as for spectra collection (see below), 
was kept to a minimum and never exceeded four minutes per 
specimen. Flies that were identified as D. subobscura or D, obscura 
according to Bachli and Burla 9 were used to set up breeding lines 
for each location sampled. All lines were kept at a minimum cen- 
sus size of 60 individuals on an artificial diet (corn-meal, sugar, 
agar, yeast, Tegosept) and at a photoperiod of 12/12 h (light/ 
dark) at 19°C. 

Data collection. Spectra were collected from anaesthetised 
flies using a Labspec® 5000 Portable Vis/NIR Spectrometer 
(350-2,500 nm; ASD Inc.) by placing flies individually on their 
backs on a 9 cm diameter Spectralon plate. The 3 mm diameter 
bifurcated fiber-optic probe was positioned about 2 mm above the 
specimen, focusing on the abdomen. The spectrometer automati- 
cally calculated and saved the average spectrum of 30 collected 
spectra of each individual. Background reference (the baseline) 
was measured using a separate 3 cm diameter Spectralon plate to 
avoid contamination. All field-caught individuals as well as 251 
randomly chosen individuals of the Fl and 421 of the F8 of the 
breeding lines were sexed and scanned. We thus included a wide 
range of individual ages in our sample. 

Data analysis. All recorded spectra were converted into 
Galactic spectrum file format using ASD ViewSpecPro. Spectra 
used for the calibration sets were pre-processed by mean-centring 
and analyzed using PLS regression and leave-one-out cross vali- 
dation 45,46 implemented in GRAMS software PLS/IQ. Spectra 
were generally very noisy below 500 nm and above 2200 nm and 
these regions were excluded from further analysis. Calibration 
models were elaborated separately for males (m) and females 
(0, because females can be easily distinguished from males and 



because Drosophila sexes differ in their CHC-profiles. 34 We per- 
formed models for the following five groups: (1) the wild, field- 
collected males, referred to as "Wm" (due to the low number 
of field-caught D. obscura females, no model could be created 
for this group), (2) the first lab-reared generation, referred to as 
"Flm" and (3) "Flf," and (4) the eighth lab-reared generation, 
referred to as "F8m" and (5) "F8f." 

The training sets for each calibration model contained 70 spec- 
tra (35 of each species). A two-way comparison in PLS analysis 
was made by assigning integer values 1 and 2 to D. subobscura 
and D. obscura, respectively. Independent validation sets, treated 
as "unknown" specimens, were then classified on the basis of the 
calibration model in each group. Spectra predicted to have a class 
value of < 1.5 were considered to belong to D. subobscura, those 
with a predicted value of > 1.5 to D. obscura. The numbers of 
PLS regression factors to be used in the prediction models were 
determined by examining the values of the predicted residual sum 
of squares 46 and the classification results of the independent vali- 
dation sets. Accuracy of the calibration models was examined by 
checking the r 2 indicating the closeness of fit between NIRS and 
reference data, the SECV of the leave-one-out procedure, and by 
calculating the prediction results using the validation sets — the 
most rigorous indicator of model quality. 47 Spectral residuals, 
which were possibly due to technical problems such as movement 
of insufficiently anaesthetised specimens, were discarded from the 
sample. Such outliers were detected by visual examination of the 
spectra using spekwin32 (F. Menges "Spekwin32 - free software 
for optical spectroscopy"- Vers. 1.71.5, 2010, http://www.effemm2. 
de/spekwin/) and by examination of the leverage and studentised 
residuals plots generated in GRAMS (compare ref. 48). 
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